Human-like cognition: visual features grouping for hard-to-group text dataset

被引:0
|
作者
Li, Xin [1 ]
Liu, Hangyuan [1 ]
Tao, Chunfeng [2 ]
Han, Ruiyi [1 ]
Yang, Shumin [1 ]
机构
[1] China Univ Petr East China, Coll Comp Sci & Technol, Qingdao, Peoples R China
[2] Bur Geophys Prospecting Inc BGPCNPC, Zhuozhou, Peoples R China
关键词
scene text spotting; visual features grouping; text correction;
D O I
10.1117/1.JEI.33.2.023002
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Most existing arbitrary shape text detection methods employ connected components and text center lines for grouping text instances, which assume that texts in adjacent positions belong to the same instance. However, many hard-to-group scene texts are too complex to be effectively processed in this way. To address this challenge, we propose a novel scene text-spotting method that utilizes feature-based clustering inspired by human cognitive principles of text perception. Our approach involves first utilizing a character spotter to obtain the location and the transcription information of the characters. Then, a lightweight recognition network extracts the visual features of the characters by their locations. These visual features are then grouped into instances through a K-means-fuzzy-net, which explicitly model visual feature similarity to effectively group the nested text, the large-margin text, the continuous text, and the one with overlapping characters. Finally, the recognition results of text instances are processed by a word correction module to improve the overall accuracy and reduce the vulnerability of individual character detection. Additionally, we have contributed a hard-to-group text dataset. Experiments demonstrate the state-of-the-art performance of our method in addressing scenarios. Hard-to-group text dataset is available at: https://github.com/baggio321/Hard-to-Group-Text-Dataset. (c) 2024 SPIE and IS&T
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Text-based robot emotion and human-like emotional transition
    Chae, Yu-Jung
    Jeon, Tae-Hee
    Kim, ChangHwan
    Park, Sung-Kee
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 838 - 845
  • [22] Hierarchical Human-Like Deep Neural Networks for Abstractive Text Summarization
    Yang, Min
    Li, Chengming
    Shen, Ying
    Wu, Qingyao
    Zhao, Zhou
    Chen, Xiaojun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) : 2744 - 2757
  • [23] A Human-Like Semantic Cognition Network for Aspect-Level Sentiment Classification
    Lei, Zeyang
    Yang, Yujiu
    Yang, Min
    Zhao, Wei
    Guo, Jun
    Liu, Yi
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6650 - 6657
  • [24] Editorial: Human-Like Advances in Robotics: Motion, Actuation, Sensing, Cognition and Control
    Jovanovic, Kosta
    Petric, Tadej
    Tsuji, Toshiaki
    Oddo, Calogero Maria
    FRONTIERS IN NEUROROBOTICS, 2019, 13
  • [25] Deriving Minimal Features for Human-Like Facial Expressions in Robotic Faces
    Casey C. Bennett
    Selma Šabanović
    International Journal of Social Robotics, 2014, 6 : 367 - 381
  • [26] Deriving Minimal Features for Human-Like Facial Expressions in Robotic Faces
    Bennett, Casey C.
    Sabanovic, Selma
    INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2014, 6 (03) : 367 - 381
  • [27] Human-like Guidance with Gaze Estimation and Classification-based Text Generation
    Nambata, Masaki
    Shimomura, Kota
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    Fujiyoshi, Hironobu
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 3122 - 3128
  • [28] Towards Human-Like Automated Test Generation: Perspectives from Cognition and Problem Solving
    Enoiu, Eduard
    Feldt, Robert
    2021 IEEE/ACM 13TH INTERNATIONAL WORKSHOP ON COOPERATIVE AND HUMAN ASPECTS OF SOFTWARE ENGINEERING (CHASE 2021), 2021, : 123 - 124
  • [29] A Cognition-Inspired Human-Like Decision-Making Method for Automated Vehicles
    Xie, Shanshan
    Yang, Yi
    Fu, Mengyin
    Zheng, Jingyue
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (08) : 9852 - 9862
  • [30] A Cognition-Inspired Human-Like Decision-Making Method for Automated Vehicles
    Xie, Shanshan
    Yang, Yi
    Fu, Mengyin
    Zheng, Jingyue
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (08) : 9852 - 9862