Arbitrarily-oriented multi-lingual text detection in video

被引:24
|
作者
Khare, Vijeta [1 ]
Shivakumara, Palaiahnakote [2 ,3 ]
Paramesran, Raveendran [1 ]
Blumenstein, Michael [4 ]
机构
[1] Univ Malaya, Fac Engn, Kuala Lumpur, Malaysia
[2] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
[3] Univ Malaya, Comp Syst & Informat Technol, BS-18,Annex Bldg, Kuala Lumpur 50603, Malaysia
[4] Univ Technol Sydney, Sch Software, Sydney, NSW, Australia
关键词
Higher order moments; Stroke width distance; dynamic window; Caption text; Region growing; Arbitrarily-oriented text detection; Multi-lingual text detection; SCENE TEXT; TRACKING; WAVELET;
D O I
10.1007/s11042-016-3941-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text detection in arbitrarily-oriented multi-lingual video is an emerging area of research because it plays a vital role for developing real-time indexing and retrieval systems. In this paper, we propose to explore moments for identifying text candidates. We introduce a novel idea for determining automatic windows to extract moments for tackling multi-font and multi-sized text in video based on stroke width information. The temporal information is explored to find deviations between moving and non-moving pixels in successive frames iteratively, which results in static clusters containing caption text and dynamic clusters containing scene text, as well as background pixels. The gradient directions of pixels in static and dynamic clusters are analyzed to identify the potential text candidates. Furthermore, boundary growing is proposed that expands the boundary of potential text candidates until it finds neighbor components based on the nearest neighbor criterion. This process outputs text lines appearing in the video. Experimental results on standard video data, namely, ICDAR 2013, ICDAR 2015, YVT videos and on our own English and Multi-lingual videos demonstrate that the proposed method outperforms the state-of-the-art methods.
引用
收藏
页码:16625 / 16655
页数:31
相关论文
共 50 条
  • [41] Firefighting in a multi-lingual world
    Anon
    Fire International, 2002, (194):
  • [42] The translation of multi-lingual cultures
    Shread, Carolyn
    TRANSLATION STUDIES, 2013, 6 (01) : 128 - 131
  • [43] A PAIR OF ARBITRARILY-ORIENTED COPLANAR CRACKS IN AN ANISOTROPIC ELASTIC SLAB
    ANG, WT
    JOURNAL OF THE AUSTRALIAN MATHEMATICAL SOCIETY SERIES B-APPLIED MATHEMATICS, 1991, 32 : 284 - 295
  • [44] A comprehensive review on detection of hate speech for multi-lingual data
    Narula, Rachna
    Chaudhary, Poonam
    SOCIAL NETWORK ANALYSIS AND MINING, 2025, 14 (01)
  • [45] Multi-lingual Text Clustering Method Using Bilingual Semantic Correspondence Analysis
    Luo Yuansheng
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1216 - 1221
  • [46] A NON-PARAMETRIC MULTI-LINGUAL CLUSTERING MODEL FOR TEMPORAL SHORT TEXT
    Kumar, Jay
    Kumar, Rajesh
    Ul Haq, Amin
    Shafiq, Sidra
    2020 17TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2020, : 58 - 61
  • [47] OUTLINEGEN: Multi-lingual Outline Generation for Encyclopedic Text in Low Resource Languages
    Subramanian, Shivansh
    Taunk, Dhaval
    Gupta, Manish
    Varma, Vasudeva
    SOCIAL NETWORKS ANALYSIS AND MINING, ASONAM 2024, PT II, 2025, 15212 : 149 - 159
  • [48] Self-Supervised Augmentation and Generation for Multi-lingual Text Advertisements at Bing
    Kou, Xiaoyu
    Zhao, Tianqi
    Zhang, Fan
    Li, Song
    Zhang, Qi
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3187 - 3196
  • [49] Concept based multi-lingual text retrieval in hybrid peer to peer network
    Li, Shaozi
    Chen, Qi'an
    Chen, Zhinxin
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 3812 - 3816
  • [50] A Low Resource Multi-lingual Simultaneous Script Identification and Text Recognition Model
    Jayati Mukherjee
    Utpal Roy
    SN Computer Science, 5 (6)