Contextual Text Block Detection Towards Scene Text Understanding

被引:2
|
作者
Xue, Chuhui [1 ,2 ]
Huang, Jiaxing [1 ]
Zhang, Wenqing [2 ]
Lu, Shijian [1 ]
Wang, Changhu [2 ]
Bai, Song [2 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] ByteDance Inc, Singapore, Singapore
来源
关键词
Scene text detection; RECOGNITION;
D O I
10.1007/978-3-031-19815-1_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing scene text detectors focus on detecting characters or words that only capture partial text messages due to missing contextual information. For a better understanding of text in scenes, it is more desired to detect contextual text blocks (CTBs) which consist of one or multiple integral text units (e.g., characters, words, or phrases) in natural reading order and transmit certain complete text messages. This paper presents contextual text detection, a new setup that detects CTBs for better understanding of texts in scenes. We formulate the new setup by a dual detection task which first detects integral text units and then groups them into a CTB. To this end, we design a novel scene text clustering technique that treats integral text units as tokens and groups them (belonging to the same CTB) into an ordered token sequence. In addition, we create two datasets SCUT-CTW-Context and ReCTS-Context to facilitate future research, where each CTB is well annotated by an ordered sequence of integral text units. Further, we introduce three metrics that measure contextual text detection in local accuracy, continuity, and global accuracy. Extensive experiments show that our method accurately detects CTBs which effectively facilitates downstream tasks such as text classification and translation. The project is available at https://sg-vilab.github.io/publication/xue2022contextual/.
引用
收藏
页码:374 / 391
页数:18
相关论文
共 50 条
  • [1] Towards a new approach of an automatic and contextual detection of meaning in text
    Fadili, Hammou
    [J]. 2013 ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2013,
  • [2] TCATD: Text Contour Attention for Scene Text Detection
    Hu, ZiLing
    Wu, Xingjiao
    Yang, Jing
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1083 - 1088
  • [3] Text Search: Towards Fast Text Localization in Scene Images
    Yang, Lei
    Cheng, Samuel
    Verma, Pramode K.
    Wang, Shuang
    [J]. PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 83 - 86
  • [4] Deep Residual Text Detection Network for Scene Text
    Zhu, Xiangyu
    Jiang, Yingying
    Yang, Shuli
    Wang, Xiaobing
    Li, Wei
    Fu, Pei
    Wang, Hua
    Luo, Zhenbo
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 807 - 812
  • [5] Max-Pooling based Scene Text Proposal for Scene Text Detection
    Dinh Nguyen Van
    Lu, Shijian
    Bai, Xiang
    Ouarti, Nizar
    Mokhtari, Mounir
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1295 - 1300
  • [6] AT-Text: Assembling Text Components for Efficient Dense Scene Text Detection
    Li, Haiyan
    Lu, Hongtao
    [J]. FUTURE INTERNET, 2020, 12 (11): : 1 - 14
  • [7] Scene Text Detection Based on Text Probability and Pruning Algorithm
    Zhou, Gang
    Liu, Yajun
    Shi, Fei
    Hu, Ying
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 726 - 735
  • [8] Text Attention and Focal Negative Loss for Scene Text Detection
    Huang, Randong
    Xu, Bo
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [9] Scene Text Detection with Inception Text Proposal Generation Module
    Zhang, Hang
    Liu, Jiahang
    Chen, Tieqiao
    [J]. ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 456 - 460
  • [10] Towards Accurate Scene Text Detection with Bidirectional Feature Pyramid Network
    Cao, Dongping
    Dang, Jiachen
    Zhong, Yong
    [J]. SYMMETRY-BASEL, 2021, 13 (03):