A New Multi-modal Technique for Bib Number/Text Detection in Natural Images

被引:4
|
作者
Roy, Sangheeta [1 ]
Shivakumara, Palaiahnakote [1 ]
Mondal, Prabir [2 ]
Raghavendra, R. [3 ]
Pal, Umapada [2 ]
Lu, Tong [4 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
[2] Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata, India
[3] Gjovik Univ Coll, Norwegian Biometr Lab, Gjovik, Norway
[4] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210008, Jiangsu, Peoples R China
关键词
Face detection; Skin detection; Text detection; Multi-modal text detection; Bib number detection; Bib number recognition; SCENE TEXT DETECTION; LICENSE PLATES; BINARIZATION; RECOGNITION; FRAMEWORK; FEATURES;
D O I
10.1007/978-3-319-24075-6_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The detection and recognition of racing bib number/text, which is printed on paper, cardboard tag, or t-shirt in natural images in marathon, race and sports, is challenging due to person movement, non-rigid surface, distortion by non-illumination, severe occlusions, orientation variations etc. In this paper, we present a multi-modal technique that combines both biometric and textual features to achieve good results for bib number/text detection. We explore face and skin features in a new way for identifying text candidate regions from input natural images. For each text candidate region, we propose to use text detection and recognition methods for detecting and recognizing bib numbers/texts, respectively. To validate the usefulness of the proposed multi-modal technique, we conduct text detection and recognition experiments before text candidate region detection and after text candidate region detection in terms of recall, precision and f-measure. Experimental results show that the proposed multi-modal technique outperforms the existing bib number detection method.
引用
收藏
页码:483 / 494
页数:12
相关论文
共 50 条
  • [1] A new multi-modal approach to bib number/text detection and recognition in Marathon images
    Shivakumara, Palaiahnakote
    Raghavendra, R.
    Qin, Longfei
    Raja, Kiran B.
    Lu, Tong
    Pal, Umapada
    PATTERN RECOGNITION, 2017, 61 : 479 - 491
  • [2] Multi-modal and Multi-spectral Registration for Natural Images
    Shen, Xiaoyong
    Xu, Li
    Zhang, Qi
    Jia, Jiaya
    COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 309 - 324
  • [3] Multi-modal text recognition and encryption in scanned document images
    Maemoona Kayani
    Abdul Ghafoor
    M. Mohsin Riaz
    The Journal of Supercomputing, 2023, 79 : 7916 - 7936
  • [4] On the Effectiveness of Images in Multi-modal Text Classification: An Annotation Study
    Ma, Chunpeng
    Shen, Aili
    Yoshikawa, Hiyori
    Iwakura, Tomoya
    Beck, Daniel
    Baldwin, Timothy
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (03)
  • [5] Multi-modal text recognition and encryption in scanned document images
    Kayani, Maemoona
    Ghafoor, Abdul
    Riaz, M. Mohsin
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (07): : 7916 - 7936
  • [6] A new technique for sampling multi-modal distributions
    Abraham, KJ
    Haines, LM
    JOURNAL OF COMPUTATIONAL PHYSICS, 1999, 155 (02) : 380 - 386
  • [7] Multi-modal Fusion Network for Rumor Detection with Texts and Images
    Li, Boqun
    Qian, Zhong
    Li, Peifeng
    Zhu, Qiaoming
    MULTIMEDIA MODELING (MMM 2022), PT I, 2022, 13141 : 15 - 27
  • [8] Object detection in multi-modal images using genetic programming
    Bhanu, B
    Lin, YQ
    APPLIED SOFT COMPUTING, 2004, 4 (02) : 175 - 201
  • [9] Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking
    Wang, Tan
    Xu, Xing
    Yang, Yang
    Hanjalic, Alan
    Shen, Heng Tao
    Song, Jingkuan
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 12 - 20
  • [10] Application of Multi-Modal Neural Networks in Verifying the Authenticity of News Text and Images
    Li, Feng
    Xu, Meiling
    Rosli, Marshima Mohd
    TRAITEMENT DU SIGNAL, 2023, 40 (06) : 2397 - 2407