A New Multi-modal Technique for Bib Number/Text Detection in Natural Images

被引:4
|
作者
Roy, Sangheeta [1 ]
Shivakumara, Palaiahnakote [1 ]
Mondal, Prabir [2 ]
Raghavendra, R. [3 ]
Pal, Umapada [2 ]
Lu, Tong [4 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
[2] Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata, India
[3] Gjovik Univ Coll, Norwegian Biometr Lab, Gjovik, Norway
[4] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210008, Jiangsu, Peoples R China
关键词
Face detection; Skin detection; Text detection; Multi-modal text detection; Bib number detection; Bib number recognition; SCENE TEXT DETECTION; LICENSE PLATES; BINARIZATION; RECOGNITION; FRAMEWORK; FEATURES;
D O I
10.1007/978-3-319-24075-6_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The detection and recognition of racing bib number/text, which is printed on paper, cardboard tag, or t-shirt in natural images in marathon, race and sports, is challenging due to person movement, non-rigid surface, distortion by non-illumination, severe occlusions, orientation variations etc. In this paper, we present a multi-modal technique that combines both biometric and textual features to achieve good results for bib number/text detection. We explore face and skin features in a new way for identifying text candidate regions from input natural images. For each text candidate region, we propose to use text detection and recognition methods for detecting and recognizing bib numbers/texts, respectively. To validate the usefulness of the proposed multi-modal technique, we conduct text detection and recognition experiments before text candidate region detection and after text candidate region detection in terms of recall, precision and f-measure. Experimental results show that the proposed multi-modal technique outperforms the existing bib number detection method.
引用
收藏
页码:483 / 494
页数:12
相关论文
共 50 条
  • [31] Multi-modal human aggression detection
    Kooij, J. F. P.
    Liem, M. C.
    Krijnders, J. D.
    Andringa, T. C.
    Gavrila, D. M.
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 144 : 106 - 120
  • [32] Multi-modal novelty and familiarity detection
    Christo Panchev
    BMC Neuroscience, 14 (Suppl 1)
  • [33] Deep Neural Architecture for Multi-Modal Retrieval based on Joint Embedding Space for Text and Images
    Balaneshin-kordan, Saeid
    Kotov, Alexander
    WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2018, : 28 - 36
  • [34] Multi-Modal Depression Detection and Estimation
    Yang, Le
    2019 8TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS (ACIIW), 2019, : 26 - 30
  • [35] A new technique for multi-modal 3D image registration
    Stippel, G
    Ellsmere, J
    Warfield, SK
    Wells, WM
    Philips, W
    BIOMEDICAL IMAGE REGISTRATION, 2003, 2717 : 244 - 253
  • [36] MIGT: Multi-modal image inpainting guided with text
    Li, Ailin
    Zhao, Lei
    Zuo, Zhiwen
    Wang, Zhizhong
    Xing, Wei
    Lu, Dongming
    NEUROCOMPUTING, 2023, 520 : 376 - 385
  • [37] StrucTexT: Structured Text Understanding with Multi-Modal Transformers
    Li, Yulin
    Qian, Yuxi
    Yu, Yuechen
    Qin, Xiameng
    Zhang, Chenquan
    Liu, Yan
    Yao, Kun
    Han, Junyu
    Liu, Jingtuo
    Ding, Errui
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1912 - 1920
  • [38] Image and Encoded Text Fusion for Multi-Modal Classification
    Gallo, I.
    Calefati, A.
    Nawaz, S.
    Janjua, M. K.
    2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, : 203 - 209
  • [39] VTLayout: A Multi-Modal Approach for Video Text Layout
    Zhao, Yuxuan
    Ma, Jin
    Qi, Zhongang
    Xie, Zehua
    Luo, Yu
    Kang, Qiusheng
    Shan, Ying
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2775 - 2784
  • [40] Text generation and multi-modal knowledge transfer for few-shot object detection
    Du, Yaoyang
    Liu, Fang
    Jiao, Licheng
    Li, Shuo
    Hao, Zehua
    Li, Pengfang
    Wang, Jiahao
    Wang, Hao
    Liu, Xu
    PATTERN RECOGNITION, 2025, 161