Arbitrarily-oriented multi-lingual text detection in video

被引:24
|
作者
Khare, Vijeta [1 ]
Shivakumara, Palaiahnakote [2 ,3 ]
Paramesran, Raveendran [1 ]
Blumenstein, Michael [4 ]
机构
[1] Univ Malaya, Fac Engn, Kuala Lumpur, Malaysia
[2] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
[3] Univ Malaya, Comp Syst & Informat Technol, BS-18,Annex Bldg, Kuala Lumpur 50603, Malaysia
[4] Univ Technol Sydney, Sch Software, Sydney, NSW, Australia
关键词
Higher order moments; Stroke width distance; dynamic window; Caption text; Region growing; Arbitrarily-oriented text detection; Multi-lingual text detection; SCENE TEXT; TRACKING; WAVELET;
D O I
10.1007/s11042-016-3941-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text detection in arbitrarily-oriented multi-lingual video is an emerging area of research because it plays a vital role for developing real-time indexing and retrieval systems. In this paper, we propose to explore moments for identifying text candidates. We introduce a novel idea for determining automatic windows to extract moments for tackling multi-font and multi-sized text in video based on stroke width information. The temporal information is explored to find deviations between moving and non-moving pixels in successive frames iteratively, which results in static clusters containing caption text and dynamic clusters containing scene text, as well as background pixels. The gradient directions of pixels in static and dynamic clusters are analyzed to identify the potential text candidates. Furthermore, boundary growing is proposed that expands the boundary of potential text candidates until it finds neighbor components based on the nearest neighbor criterion. This process outputs text lines appearing in the video. Experimental results on standard video data, namely, ICDAR 2013, ICDAR 2015, YVT videos and on our own English and Multi-lingual videos demonstrate that the proposed method outperforms the state-of-the-art methods.
引用
收藏
页码:16625 / 16655
页数:31
相关论文
共 50 条
  • [1] Arbitrarily-oriented multi-lingual text detection in video
    Vijeta Khare
    Palaiahnakote Shivakumara
    Raveendran Paramesran
    Michael Blumenstein
    Multimedia Tools and Applications, 2017, 76 : 16625 - 16655
  • [2] Multi-Oriented and Multi-Lingual Scene Text Detection With Direct Regression
    He, Wenhao
    Zhang, Xu-Yao
    Yin, Fei
    Liu, Cheng-Lin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) : 5406 - 5419
  • [3] Multi-Lingual Text Recognition from Video Frames
    Sharma, Nabin
    Mandal, Ranju
    Sharma, Rabi
    Roy, Partha P.
    Pal, Umapada
    Blumenstein, Michael
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 951 - 955
  • [4] AON: Towards Arbitrarily-Oriented Text Recognition
    Cheng, Zhanzhan
    Xu, Yangliu
    Bai, Fan
    Niu, Yi
    Pu, Shiliang
    Zhou, Shuigeng
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5571 - 5579
  • [5] A New Method for Word Segmentation from Arbitrarily-Oriented Video Text Lines
    Sharma, Nabin
    Shivakumara, Palaiahnakote
    Pal, Umapada
    Blumenstein, Michael
    Tan, Chew Lim
    2012 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING TECHNIQUES AND APPLICATIONS (DICTA), 2012,
  • [6] Arbitrarily-Oriented Text Detection in Low Light Natural Scene Images
    Xue, Minglong
    Shivakumara, Palaiahnakote
    Zhang, Chao
    Xiao, Yao
    Lu, Tong
    Pal, Umapada
    Lopresti, Daniel
    Yang, Zhibo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2706 - 2720
  • [7] Multi-lingual scene text detection and language identification
    Saha, Shaswata
    Chakraborty, Neelotpal
    Kundu, Soumyadeep
    Paul, Sayantan
    Mollah, Ayatullah Faruk
    Basu, Subhadip
    Sarkar, Ram
    PATTERN RECOGNITION LETTERS, 2020, 138 : 16 - 22
  • [8] A New Wavelet-Laplacian Method for Arbitrarily-Oriented Character Segmentation in Video Text Lines
    Liang, Guozhu
    Shivakumara, Palaiahnakote
    Lu, Tong
    Tan, Chew Lim
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 926 - 930
  • [9] Baseline detection of multi-lingual unconstrained handwritten text lines
    Chakraborty, Dibyayan
    Pal, Umapada
    PATTERN RECOGNITION LETTERS, 2016, 74 : 74 - 81
  • [10] A New Laplacian Method for Arbitrarily-Oriented Word Segmentation in Video
    Shivakumara, P.
    Suhil, M.
    Guru, D. S.
    Tan, C. L.
    2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 339 - 343