Human-like cognition: visual features grouping for hard-to-group text dataset

被引:0
|
作者
Li, Xin [1 ]
Liu, Hangyuan [1 ]
Tao, Chunfeng [2 ]
Han, Ruiyi [1 ]
Yang, Shumin [1 ]
机构
[1] China Univ Petr East China, Coll Comp Sci & Technol, Qingdao, Peoples R China
[2] Bur Geophys Prospecting Inc BGPCNPC, Zhuozhou, Peoples R China
关键词
scene text spotting; visual features grouping; text correction;
D O I
10.1117/1.JEI.33.2.023002
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Most existing arbitrary shape text detection methods employ connected components and text center lines for grouping text instances, which assume that texts in adjacent positions belong to the same instance. However, many hard-to-group scene texts are too complex to be effectively processed in this way. To address this challenge, we propose a novel scene text-spotting method that utilizes feature-based clustering inspired by human cognitive principles of text perception. Our approach involves first utilizing a character spotter to obtain the location and the transcription information of the characters. Then, a lightweight recognition network extracts the visual features of the characters by their locations. These visual features are then grouped into instances through a K-means-fuzzy-net, which explicitly model visual feature similarity to effectively group the nested text, the large-margin text, the continuous text, and the one with overlapping characters. Finally, the recognition results of text instances are processed by a word correction module to improve the overall accuracy and reduce the vulnerability of individual character detection. Additionally, we have contributed a hard-to-group text dataset. Experiments demonstrate the state-of-the-art performance of our method in addressing scenarios. Hard-to-group text dataset is available at: https://github.com/baggio321/Hard-to-Group-Text-Dataset. (c) 2024 SPIE and IS&T
引用
收藏
页数:18
相关论文
共 50 条
  • [41] 'They're creepy creatures with human-like features': children's experiences of visual hallucinations in Charles Bonnet syndrome-a qualitative study
    Jones, Lee
    Ditzel-Finn, Lara
    Mcdonald, Leanne
    Moosajee, Mariya
    ARCHIVES OF DISEASE IN CHILDHOOD, 2025,
  • [42] Motion planning for off-road autonomous driving based on human-like cognition and weight adaptation
    Wang, Yuchun
    Gong, Cheng
    Gong, Jianwei
    Jia, Peng
    JOURNAL OF FIELD ROBOTICS, 2024, 41 (06) : 1702 - 1723
  • [43] Toward Human-Like Social Robot Navigation: A Large-Scale, Multi-Modal, Social Human Navigation Dataset
    Nguyen, Duc M.
    Nazeri, Mohammad
    Payandeh, Amirreza
    Datar, Aniket
    Xiao, Xuesu
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7442 - 7447
  • [44] A Human-Like Siamese-Based Visual-Tactile Fusion Model for Object Recognition
    Wang, Fei
    Li, Yucheng
    Tao, Liangze
    Wu, Juan
    Huang, Gewen
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (03) : 850 - 863
  • [45] A human-like visual-attention-based artificial vision system for wildland firefighting assistance
    Madani, Kurosh
    Kachurka, Viachaslau
    Sabourin, Christophe
    Amarger, Veronique
    Golovko, Vladimir
    Rossi, Lucile
    APPLIED INTELLIGENCE, 2018, 48 (08) : 2157 - 2179
  • [46] A human-like visual-attention-based artificial vision system for wildland firefighting assistance
    Kurosh Madani
    Viachaslau Kachurka
    Christophe Sabourin
    Veronique Amarger
    Vladimir Golovko
    Lucile Rossi
    Applied Intelligence, 2018, 48 : 2157 - 2179
  • [47] HCNN: A Neural Network Model for Combining Local and Global Features Towards Human-Like Classification
    Zhang, Tielin
    Zeng, Yi
    Xu, Bo
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2016, 30 (01)
  • [48] Cognition-inspired behavioural feature identification and motion planning ways for human-like automated driving vehicles
    Xie, Shanshan
    Zheng, Jingyue
    Wang, Jianqiang
    IET INTELLIGENT TRANSPORT SYSTEMS, 2023, 17 (04) : 754 - 766
  • [49] Use of superordinate labels yields more robust and human-like visual representations in convolutional neural networks
    Ahn, Seoyoung
    Zelinsky, Gregory J.
    Lupyan, Gary
    JOURNAL OF VISION, 2021, 21 (13):
  • [50] Towards automatic visual fault detection in highly expressive human-like animatronic faces with soft skin
    Mayet, Ralf
    Diprose, James
    Pandey, Amit Kumar
    2019 28TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2019,