A New Multi-modal Technique for Bib Number/Text Detection in Natural Images

被引:4
|
作者
Roy, Sangheeta [1 ]
Shivakumara, Palaiahnakote [1 ]
Mondal, Prabir [2 ]
Raghavendra, R. [3 ]
Pal, Umapada [2 ]
Lu, Tong [4 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
[2] Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata, India
[3] Gjovik Univ Coll, Norwegian Biometr Lab, Gjovik, Norway
[4] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210008, Jiangsu, Peoples R China
关键词
Face detection; Skin detection; Text detection; Multi-modal text detection; Bib number detection; Bib number recognition; SCENE TEXT DETECTION; LICENSE PLATES; BINARIZATION; RECOGNITION; FRAMEWORK; FEATURES;
D O I
10.1007/978-3-319-24075-6_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The detection and recognition of racing bib number/text, which is printed on paper, cardboard tag, or t-shirt in natural images in marathon, race and sports, is challenging due to person movement, non-rigid surface, distortion by non-illumination, severe occlusions, orientation variations etc. In this paper, we present a multi-modal technique that combines both biometric and textual features to achieve good results for bib number/text detection. We explore face and skin features in a new way for identifying text candidate regions from input natural images. For each text candidate region, we propose to use text detection and recognition methods for detecting and recognizing bib numbers/texts, respectively. To validate the usefulness of the proposed multi-modal technique, we conduct text detection and recognition experiments before text candidate region detection and after text candidate region detection in terms of recall, precision and f-measure. Experimental results show that the proposed multi-modal technique outperforms the existing bib number detection method.
引用
收藏
页码:483 / 494
页数:12
相关论文
共 50 条
  • [21] Recognition of camellia oleifera fruits in natural environment using multi-modal images
    Zhou H.
    Jin S.
    Zhou L.
    Guo Z.
    Sun M.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2023, 39 (10): : 175 - 182
  • [22] Flood Detection Using Multi-Modal and Multi-Temporal Images: A Comparative Study
    Islam, Kazi Aminul
    Uddin, Mohammad Shahab
    Kwan, Chiman
    Li, Jiang
    REMOTE SENSING, 2020, 12 (15)
  • [23] Multi-text multi-modal reading processes and comprehension
    Cromley, Jennifer G.
    Kunze, Andrea J.
    Dane, Aygul Parpucu
    LEARNING AND INSTRUCTION, 2021, 71
  • [24] Multi-modal browsing of images in Web documents
    Chen, F
    Gargi, U
    Niles, L
    Schütze, H
    DOCUMENT RECOGNITION AND RETRIEVAL VI, 1999, 3651 : 122 - 133
  • [25] Multi-modal pedestrian detection with misalignment based on modal-wise regression and multi-modal IoU
    Wanchaitanawong, Napat
    Tanaka, Masayuki
    Shibata, Takashi
    Okutomi, Masatoshi
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (01)
  • [26] Is Multi-Modal Necessarily Better? Robustness Evaluation of Multi-Modal Fake News Detection
    Chen, Jinyin
    Jia, Chengyu
    Zheng, Haibin
    Chen, Ruoxi
    Fu, Chenbo
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (06): : 3144 - 3158
  • [27] Multi-Modal Detection of Man-Made Objects in Simulated Aerial Images
    Baran, Matthew S.
    Tutwiler, Richard L.
    Natale, Donald J.
    Bassett, Michael S.
    Harner, Matthew P.
    ALGORITHMS AND TECHNOLOGIES FOR MULTISPECTRAL, HYPERSPECTRAL, AND ULTRASPECTRAL IMAGERY XIX, 2013, 8743
  • [28] CONCEPT DETECTION IN LONGITUDINAL BRAIN MR IMAGES USING MULTI-MODAL CUES
    Caban, Jesus J.
    Lee, Noah
    Ebadollahi, Shahram
    Laine, Andrew E.
    Kender, John R.
    2009 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, VOLS 1 AND 2, 2009, : 418 - +
  • [29] Multi-modal Detection of Cyberbullying on Twitter
    Qiu, Jiabao
    Moh, Melody
    Moh, Teng-Sheng
    ACMSE 2022: PROCEEDINGS OF THE 2022 ACM SOUTHEAST CONFERENCE, 2022, : 9 - 16
  • [30] UNSUPERVISED BUILDING CHANGE DETECTION IN MULTI-MODAL SAR IMAGES USING CYCLEGAN
    Bergamasco, Luca
    Bovolo, Francesca
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 483 - 486